A Robust Sequential Bayesian Method for Identification of Differentially Expressed Genes
نویسندگان
چکیده
A DNA microarray experiment simultaneously measures the expression levels of thousands of genes. An important question is to identify genes that express differentially between two types of tissues or at different experimental conditions. Since large numbers of genes are compared simultaneously, simple use of significance tests can easily lead to false positive findings. We propose a sequential procedure for estimating the empirical null distribution of multiple hypothesis testing and apply the procedure to identify differentially expressed genes in microarray experiments. Our procedure can be viewed as a new method to estimate the q-value proposed by Storey (2002). The key intuition is to obtain an estimate of the null distribution that is robust to the observations from the alternative distribution. Technically, we borrow strength from the missing data literature so that we can avoid estimating the density function corresponding to differentially expressed genes nonparametrically, but can focus on estimating the null density. Numerical comparisons between our method and Storey’s original method were conducted in simulated and real data examples. The numerical results show that our procedure outperforms the originally estimated q-values in almost all scenarios.
منابع مشابه
The Application of a Non-Radioactive DD-AFLP Method for Profiling of Aeluropus lagopoides Differentially Expressed Transcripts under Salinity or Drought Conditions
Aeluropus lagopoides is a salt and drought tolerant grass from Poaceae family, distributed widely in arid regions. There is almost no information about the genetics or genome of this close relative of wheat that stands harsh conditions of deserts. Differential Display Amplified fragment length polymorphism (DD-AFLP) led to the improvement of a non-radioactive method for which many parameters we...
متن کاملRNA-Seq Bayesian Network Exploration of Immune System in Bovine
Background: The stress is one of main factors effects on production system. Several factors (both genetic and environmental elements) regulate immune response to stress. Objectives: In order to determine the major immune system regulatory genes underlying stress responses, a learning Bayesian network approach for those regulatory genes was applied to RNA-...
متن کاملRobust Modeling of Differential Gene Expression Data Using Normal/Independent Distributions: A Bayesian Approach
In this paper, the problem of identifying differentially expressed genes under different conditions using gene expression microarray data, in the presence of outliers, is discussed. For this purpose, the robust modeling of gene expression data using some powerful distributions known as normal/independent distributions is considered. These distributions include the Student's t and normal distrib...
متن کاملIdentification and Functional Prediction of Long Non-Coding RNAs Responsive to Drought stress in Lens culinaris L.
Drought stress is one of the main environmental factors that affects growth and productivity of crop plants, including lentil. In the course of evolution evolution, crucial genetic regulations mediated by non-coding RNAs (ncRNAs) have emerged in plant in response to drought and other abiotic stresses. In the present study, after identifying lncRNAs within the expression profile of lentil, RNA-s...
متن کاملIdentification of key genes and pathways involved in vitiligo vulgaris by gene network analysis
Background and Aim: Vitiligo vulgaris is an acquired, chronic skin and hair condition characterized clinically by loss of melanin, which, if untreated, is typically progressive and irreversible. The aim of the present study was to identify potential genes involved in the pathogenesis of vitiligo. Methods: One dataset of mRNA expression in patients with vitiligo (GSE65127) were obtained from ...
متن کامل